NOTES ON FREE PROBABILITY THEORY 



DIMITRI SHLYAKHTENKO. 



These notes are from a 4-lecture mini-course taught by the 
author at the conference on von Neumann algebras as part of 
the "Geometrie non commutative en mathematiques et physique" 
month at CIRM. 



Contents 



Lecture 1. Free Independence and Free Harmonic Analvsis.1 2 



1.1. ProbabiHtv snaces.! 


2 


1.2. Non-commutative nrobabihtv spaces. 


3 


1.3. Classical indenendence.! 


4 


1.4. Free nroducts of non-commutative probabilitv spaces. 


5 


1.5. Free Fock snace. 


6 


1.6. Free Central Limit Theorem. 


8 


1.7. Free Harmonic Analvsis.l 


10 


1.8. Further tonics. 


16 


Lecture 2. Random Matrices and Free Probabilitv. 


17 


2.1. Random matrices. 


17 


2.2. Asvmntotics of random matrices. 


18 


2.3. An annlication to random matrix theory. 


22 


2.4. Annlications to von Neumann alsrebras. 


22 


Lecture 3. Free Entronv via Microstates. 


23 


3.1. Definition of free entroDv.1 


23 


3.2. Pronerties of free entronv. 


25 


3.3. Free entronv dimension. 


27 


3.4. Pronerties of free entronv dimension. 


28 


3.5. Relation with L^-Betti numbers.1 


30 


Lecture 4. Non-microstates Annroach to Free Entronv. 


31 


4.1. A non-risrorous derivation of the non-microstates definition. 


4.2. Pronerties of Y* .1 


35 


4.3. Non-microstates free entropv dimension. 


35 


References! 


36 



NOTES ON FREE PROBABILITY THEORY 



2 



Lecture 1. Free Independence and Free Harmonic 



Free probability theory was developed by Voiculescu as a way 
to deal with von Neumann algebras of free groups. In addition 
to the view of von Neumann algebras as "non-commutative mea- 
sure spaces", which was already presented in this conference, 
free probability theory considers von Neumann algebras as "non- 
commutative probability spaces". 

There are by now several standard references on free probabil- 
ity theory, of which we mention two: IIVDN92[ IVoiOOII . 

1.1. Probability spaces. Recall that a classical probability 
space is a measure space {X,B,fi). Here i? is a sigma-algebra 
of subsets of X, and /i is a measure, which is a probability mea- 
sure, i.e. fi{X) = 1. One thinks of X as a set of events and for 
Y e B, the measure is a probability of an event occurring in 
the set Y. 

1.1.1. Random variables; laws. An alternative point of view on 
probability theory involves considering random variables, i.e., 
measurable functions f : X ^ C One can think of a random 
variable as a measurement, which assigns to each event x e X a 
value f{x). Note that the probability of the value of / lying in a 
set A c C is exactly fi{f'^{A)) = {f^fi){A). Thus the law of /, /i/, 
defined to be the push-forward measure /i/ = on C, measures 
the probabilities that / assumes various values. 

1.1.2. The expectation E. Let us say that / G L'^{X,fi) is an es- 
sentially bounded random variable. Then the integral 



has the meaning of the expected value of /. For this reason, the 
linear functional E : (X, fi) ^ C given by integration against 
yu is called an expectation. We note that E satisfies: ^(1) = 1 
(normalization), E{f) > if / > (positivity). 

Note that the knowledge of {X, B, /i) is equivalent (up to an iso- 
morphism and up to null sets) to the knowledge of (X, jj,) and 
E. Thus the notion of a classical probability space can be phrased 
entirely in terms of commutative (von Neumann) algebras. 



Analysis. 
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1.2. Non-commutative probability spaces. We now play the 
usual game of dropping the word "commutative" in a definition: 

Definition 1.1. An algebraic non-commutative probability space 
is a pair {A, 0) consisting of a unital algebra A and a linear func- 
tional : A ^ C, so that 0(1) = 1. 

Thus we think of a e A as a "non-commutative random vari- 
able", 0(a) as its "expected value" and so on. Of course, any classi- 
cal probability space is also a non-commutative probability space. 
But there are many interesting genuinely non-commutative prob- 
ability spaces. For example, if F is a discrete group, we could set 
A = Cr (the group algebra) and = rr (the group trace). Here if 
^ G r c Cr, then (pig) = if ^ ^ 1 and 0(1) = 1. The same con- 
struction works with A replaced by the reduced group C* -algebra 
of r, or the von Neumann algebra of F. 

1.2.1. Positivity. Operator algebras give one a "test" of which al- 
gebraic non-commutative spaces "exist in nature". These are pre- 
cisely those non-commutative probability spaces that can be rep- 
resented by (possibly unbounded) operators on a Hilbert space H, 
so that is a linear functional given by a vector-state, 0(a) = 
{h, ah) for some h e H. If A is a *-algebra, it is not hard to charac- 
terize these (via the GNS construction) in terms of the properties 
of 0: must be positive, i.e., (f>{a*a) > for all a e A. 

1.2.2. The law of a random variable. Recall that we assigned to 
a classical random variable / its law Hf. If A is an algebra of 
operators on a Hilbert space H, 0( ) = {h, -h) and a & A is self- 
adjoint, then the spectral theorem gives us a measure Va on R 
valued in the set of projections on H, so that 




If we let 

then is a measure on M. It is not hard to check that if we are 
in the classical situation and a e L°^(X, /i,), H = L'^{X,ij,), h = 1, 
then this construction gives us precisely the law of a. 

1.2.3. Moments. However, if a is not self-adjoint, or if we are deal- 
ing with a A; -tuple of random variables, there is no description of 
the law of a in terms of a measure. 
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Fortunately, for / e L°°{X,n) the moments of /, i.e., the ex- 
pected values E{fP), p — 1, 2, . . . are exactly the same as the mo- 
ments of the law /// of /. Indeed, 



is exactly the p-th moment of /if. For essentially bounded /, the 
moments of /x/ determine 

Thus given a family F of variables oi, . . . , a„ e A, we say that 
an expression of the form (f){ai-^ ■ ■ ■ ai^) is the ii, . . . , ip-th moment of 
the family F. The collection of all moments can be thought of as 
a linear functional /x^ defined on the algebra of polynomials in n 
indeterminates ii, . . . , i„ by 



This functional /x^ is called the joint law, or joint distribution, of 
the family F. 

1.3. Classical independence. 

Definition 1.2. Two random variables / and g in L°°{X,ij,) are 
called independent, if 



for all n, m > 0. 

Equivalently E{FG) = whenever E{F) = E{G) = and F is 
in the algebra W*{f) generated by /, while G e W*{g). 

The equality E{fg) = E{f )E{g) is a consequence of the state- 
ment that "the probability that the value of / lies in a set A and 
the value of g lies in the set B is the product of the probabilities 
that the value of / lies in A and the value of g lies in B", which is 
a more familiar way of phrasing independence. 

If X = Xi X X2 and 11 = iiix 112, then any functions /, g so that / 
depends only on the X^ coordinate and g only on the X2 coordinate 
are independent. Note that another way of saying this is that 
the random variables fi®l and I® giin L'^ {Xi, fii)®L'^ {X2, ^2) 
are independent, for any /i e L°°{Xi,ni) and gi e L°°(X2,/X2). 
Thus independence has to do with the operation of taking tensor 
products of probability spaces. 




fxpip) = (pip{ai,...,an)). 



E{rg^) = E{nE{gn 
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1.4. Free products of non-commutative probability spaces. 

There is "more room" in the non-commutative universe to accom- 
modate a different way of combining two non-commutative prob- 
abihty spaces: free products. Just Hke the notion of a tensor 
product can be used to recover the notion of independence, free 
products have led Voiculescu to discover the notion of free inde- 
pendence. 

1.4.1. Free products of groups. We start with a motivating exam- 
ple. Let Fi and T2 be two discrete groups. View the group algebra 
of the free product C(ri * V2) as a non-commutative probability 
space by letting be the group trace; for g e Vi* T2, (pig) = 
unless g — I. 

Let us understand the relative positions of CFi and inside 
of the group algebra of the free product C(ri * 12). Let w eTi*r2 
be a word. Thus g = gr ■ ■ gn with gj g rj(j). We may carry out 
multiplications and cancellations until we reduce the word so that 
consequent letters lie in different groups; i.e., ^ i(2), i(2) ^ 
i(3) and so on. The resulting word is non-trivial if all g^i, . . . , gn are 
non-trivial. Thus: 

provided that g^ e Ti^^, ^ i{2), i{2) ^ i{3), . . ., and (j){gi) = 
0(^2) = . . . = 0. 
By linearity we get: 

Proposition 1.3. i/a e C(ri * r2) has the form 

Q, — Ui • • • (Xjii 

with Qj e Crj(j), ^ i(2), i{2) ^ i{3), . . ., and (f){ai) = ^(02) — 
. . . — 0, then 

0(a) = 0. 

We now note that this proposition allows one to compute d on 
C(ri * 12) = CFi * Cr2 in terms of its restriction to CTi and CPs. 
Indeed, an arbitrary element of CFi * Cr2 is a linear combination 
of 1 and of terms of the form 

ai---a„, e Crj(j), 7^ i(2), i(2) 7^ i(3), . . . . 

But then the equation 

= 0((ai - 0(ai))(a2 - 0(02)) ■ ■ ■ (a„ - 0(a„))) 

allows one to express 4>{ai ■ ■ - an) in terms of values of on shorter 
words. By induction, this allows one to express in terms of 0|cri 
and 01 era- 



NOTES ON FREE PROBABILITY THEORY 



6 



1.4.2. Free products of algebras. Such an expression is universal 
and works in any free product of two algebras (not necessarily of 
group algebras). We thus say: 

Definition 1.4. |Voi85] Let (y4i,0i) and (^42,02) be two non- 
commutative probability spaces. We call the unique linear func- 
tional on * A2 which satisfies 

0(ai---a„) = 0, ttj G v4i(j), 7^ ii(2), z(2) 7^ z(3), . . . , 

0i(i)(«i)=O, Vj 

the free product of 0i and 02- It is denoted 0i * 02- 

One can check that the free product of two positive linear func- 
tionals is positive (to do so it is the easiest to make sense of the 
product of the underlying GNS representations). Thus one can 
talk about (reduced) free products of C* -algebras or von Neumann 
algebras by passing to the appropriate closure in the GNS repre- 
sentation associated to the free product functional. 

1.4.3. Free independence. By analogy with the relationship be- 
tween classical independence and tensor products, Voiculescu 
gave the following definition: 

Definition 1.5. [Voi85] Let Fi, F2 c (A, 0) be two families of non- 
commutative random variables. We say that Fi and F2 are freely 
independent, if 

0(ai ■ ■ ■ a„) = 

whenever e Alg(l, ^ z(2), z(2) 7^ z(3), . . ., and 0(ai) = 

0(a2) = . . . = 0. 

Here Alg(5') denote the algebra generated by a set S. 
We should point out a certain similarity between this definition 
and the classical independence, where the requirement was that 

E{FG) = if E{F) = E{G) = 0. 

1.5. Free Fock space. We give an example of freely indepen- 
dent random variables that does not come from groups. 

1.5.1. Free Fock space. Let H he a Hilbert space, be a vector, 
and let 

F{H) = CQ®H®H®H(S--- 

be the Hilbert space direct sum of the tensor powers otH (the one- 
dimensional space Cfi is thought of as the zeroth tensor power of 
H). This space is called the free (or full) Fock space, by analogy 
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with the symmetric and anti-symmetric Fock spaces (where the 
symmetric or anti-symmetric tensor product is used instead). 

1.5.2. Free creation operators. For h E H consider the left cre- 
ation operator 

^{h) : F{H) -> F{H) 

given by 

i{h)hi ^ • • • <S> hn — h <^ hi <^ ■ ■ • <S> hn 
(here h^Q = hhy convention). Then i{h)* exists and is given by 

i{h)*hi ® ■ ■ ■ K = {h, hi)h2 ® ■ ■ ■ ® /i„ 

and £{h)*fl — 0. The operator £{h)* is also called the annihilation 
operator. 
These operators satisfy 

r{h)i{g) = {h,g)l. 

In particular, the map 

h ^ i{h) 

is a linear isometry between H (with its Hilbert space norm) and 
the closed linear span of {i{h) : h e H}, taken with the operator 
norm. 

1.5.3. Relation with non-crossing diagrams. Let /ii, . . . , /i„ e if be 
an orthonormal family. Let £j = £{hj). Thus £*ij = 6ijl. 

The joint distribution of the family ...,£„, £*} (also 

known as the * -distribution of {£i, . . . ,£n}) has a nice combina- 
torial description. 

Suppose that we are interested in 

where e {1,. . .,n} and g{j) e {■,*}, j — 1, . . . ,k (by £^ we mean 
£* i{ g = * and ij it g = ■). 

Mark k points on the x-axis in half-plane {{x.y) : y > 0} at 
positions (1, 0), . . . , (A;, 0), and color them by n colors, so that the 
j-th point point is colored with the ^(j)-th color. Attach to the j-th 
point the line segment from (j.O) to {j, 1). Orient this segment 
upwards (towards infinity) if = ■ and orient it downwards 
(toward the x-axis) if = *. Color the segment the same way 
as the j-th point, from which it is drawn. 

Then there exists at most one way of drawing a diagram so that: 
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o The upper end of every segment is connected to the upper 
end of exactly one other segment, and all segments con- 
nected together have the same color; 

o Orient each line connecting two segments counter- 
clockwise. Then the orientation of the line is compatible 
with the orientation of the segments; 

o The lines do not cross. 

It is not hard to prove that (pii^ll^ ■ ■ ■ ^il^)) = ^ iff such a diagram 
exists, while (pii^^l^ ■ ■ -^ilk)) = otherwise. 

1.5.4. Moments of(ii+l\. Utilizing this description one can prove, 
for example, that 

where Ck is the number of non-crossing pairings between the inte- 
gers {1, . . . , fc}. Recall that a pairing of {1, . . . , fc} is an equivalent 
relation on this set, so that each equivalence class has exactly two 
elements. Non-crossing pairings are ones for which one can draw 
lines above the real axis M D {1, . . . , A;}, connecting the equivalent 
classes of the pairings, and having no intersections (more gener- 
ally, one can in a similar way define non-crossing partitions of the 
set {1, . . . , A;}). We shall later see that the moments of X = (£i +£*) 
are related to the semicircle law. 

Non-crossing diagrams and non-crossing partitions have a very 
deep connection with free probability; this connection is beyond 
the scope of these notes (see e.g. |Spe981). We will point out later, 
however, how this connection explains the relationship between 
freeness and large random matrices. 

1.5.5. Free independence. Let A = C*{i{h) : h e H) and let (p : 
A ^ Che given by 

0(a) = {n,an). 

The C* -algebra A is an extension of the Cuntz algebra 0„, n = 
dimH if n < oo and is isomorphic to Ooo if dimi/ = oo. 

It is not hard to prove that if Hi ± H2 are two subspaces of H, 
then the algebras 

C*{i{h) -.he Hi) and C*{£{h) : h e H^) 

are freely independent in (A, 0). 

1.6. Free Central Limit Theorem. 



NOTES ON FREE PROBABILITY THEORY 



9 



1.6.1. Convergence in moments. We say that a sequence X„ of 
random variables converges in moments to the law a random vari- 
able X, if in moments; that is to say, for any p > 0, 

E{Xl) E{X^). 

This definition makes sense verbatim (with the replacement of E 
by 0) in the setting of a non-commutative probability space. 

1.6.2. Classical CLT. Let Xi, . . . , ... be independent random 
variables, so that for all j, E{Xj) — 0, E{Xj) = 1, and so that 
for any p > 0, sup„£'(X^) < Cp for some constants Cp < oo. The 
classical central limit theorem states: 

Theorem 1.6. Let 

Yn^^iX, + --- + X^). 

Then the laws of the random variables F„ converge in moments to 
the Gaussian law /^causs given by 

dHGa.nss{t) = ex.-p{-t'^ /2)dt. 

The main tool used in the proof of this theorem is the fact that 
if Zi and Z2 are independent random variables, then the law of 
their sum is given by a convolution formula: 

One then utilizes the fact that the Fourier transform ^ satisfies 

Thus if we write = log fi, then 

Using this one can compute L^^ and argue that it is quadratic 
in t. This implies that /iy„ converge in moments to a measure 
whose Fourier transform is proportional to exp(— i^/2)dt, so that 

1.6.3. Free CLT. Amazingly, the statement of the free central 
limit theorem is essentially the same as that of the classical one. 
The only difference is the replacement of the requirement of in- 
dependence by that of free independence. This is only a single 
example of a surprising number of parallels between the behav- 
ior of independent and freely independent random variables. 
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Let Xi, . . . ,Xn, ■ ■ ■ be freely independent random variables, so 
that for all j, 4>{Xj) = 0, (f){Xj) = 1, and so that for any p > 0, 
sup„ 0(|X^|) < Cp for some constants Cp < oo. The classical central 
limit then states: 

Theorem 1.7. [Voi85l Let 

Then the laws of the random variables converge in moments to 
the Gaussian law /isemicirc given by 

^/^semicirc(^) t'^dt. 

Zn 

We will postpone the proof of this theorem until we get to talk 
about the i?-transform. For now let us just note that we need a 
tool to compute the distribution of Zi + Z2 in terms of the distri- 
butions of Zi and Z2 if Zi and Z2 are freely independent. 

1.7. Free Harmonic Analysis. The corresponding classical 
problem was involved computing the convolution of two measures 
via the Fourier transform. 

1.7.1. Free additive convolution. By analogy with the classical 
situation, Voiculescu gave the following definition: 

Definition 1.8. BVoiSSH Let ni and ^2 be two probability measures 
on M. We define their free additive convolution /ii ffl fi2 to be the 
law of the random variable Zi + Z2, where Zi and Z2 are freely 
independent, and fizj = l^j, j = 1,2. 

Since Zi, Z2 are free in {A, cj)), the freeness condition determines 
the restriction of to Alg(Zi, Z2) in terms of the restrictions of 
to Alg(Zj), j = 1,2. Thus the joint distribution of Zi and Z2 de- 
pends only on fizi and fiz-z ■ Thus the distribution of Zi + Z2 (which 
depends only on the joint distribution of Zi and Z2) depends only 
on fizi = l^i and ^iz-z =1^2- It follows that /ii ffl /i2 is well-defined. 

Note that ffl is an operation on the space of probability mea- 
sures on M. 

Example 1.9. Let yu be a probability measure and let 5^ be the 
point mass at x. Then ffl (5^. = /i^,., the translate of ^ by x. In 
particular, /i ffl 5x is the same as the classical convolution fi* 6x. 
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1.7.2. R-transform. There is a free analog of the logarithm of the 
Fourier transform, which linearizes free additive convolution. 
Let /i be a probability measure on M, and let 

G.(C) = / 9C > 

be a function defined in the upper half-plane. This function is 
sometimes callled the Cauchy transform of /i. 
If 11 has moments of all orders (e.g., if it is compactly supported), 
is a power series in and we have 

p>0 



where 



are the moments of /i. Thus G^ is the generating function for the 
moments of /i. 
Define R^{z) by the equation 



( ^ + R^{z) 1 = z. 



It turns out that R^{z) is analytic in a certain region in C; how- 
ever, one can simply understand it as a formal power series in z 
and regard the equation above as an equation involving composi- 
tion of formal power series. 

Voiculescu proved the following linearization theorem, which 
shows that the map /i i— i?^ is a free analog of the logarithm of 
the Fourier transform. 

Theorem 1.10. l' Voi85ll Let R^{z) = En>o"n+i'2" the R- 
transform of fi. Then: 

(a) Un is a universal polynomial expression in the first n moments 
of fx; 

(b) Rf,{z) = z if and only if = /Xsemicirc/ i-e., d^{t) = - t^dt; 

(c) i?^iffl^2(z) = R^,^{z) + R^^{z); 

(d) IfY has law ^ and A e M, then R^^y{z) = A-R^(Az). 

1.7.3. Proof of additivity of R-transform. We will sketch a proof 
of (a), (b) and (c). We start with a Lemma. 
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Lemma 1.11. Let X e (M, ip) be a non-commutative random vari- 
able. Fix h e C, \\h\\ — 1. For a sequence of numbers ai, a2, . . ., let 

N 

acting on the full Fock space F{C). Then there exists a unique 
sequence of numbers ai, 02, . . . „ so that for each N, 

iP{X^)^<P{Yi,), VO<j<iV + l. 

Moreover, each 0^+1 is a polynomial in {iJj(X^), < j < k + 1}, and 
this polynomial is universal, and does not depend on X. 

The proof is based on an inductive argument and the combina- 
torial formula for moments of free creation operators. 

1.7.4. Combinatorial definition of R-transform. Given X, let 
fli, 02, be as in the Lemma above. Consider the formal power 
series 

n>0 

For now we'll consider i?^ given by this new definition, and call it 
the "combinatorial /?- transform". We shall later prove that Rfj,{z) 
satisfies our old analytic definition in terms of given above; in 
particular, it will follow that = a„. 

1.7.5. Additivity of combinatorial R-transform. 
Proposition 1.12. R^,,m^i2 = + Rn2- 

Proof. Let £1,^2 be two free creation operators on the free Fock 
space -F(C^), associated to a pair of orthonormal vectors. 
Given jii and 112, let 

Y,{n) = + E = + E ^k+A 

k<n k<n 

be random variables in C*{ii), C*{£2), respectively, so that their 
first n moments are the same as the first n moments of /^i and ij,2, 
respectively. 

Since C*{ii) and C*{i2) are freely independent, Yi{n) and Y2{n) 
are freely independent. Since moments of order up to n of + 
I2 {n) depend only on the moments of order up to n of Yi (n) and 
Y2{n), we see that the moments of order up to n of /xi ffl /X2 and 
Yi{n) + Y2{n) are the same. 
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We leave to the reader the combinatorial exercise to check that 
the moments of Yi(?t,) + ¥2(12) are the same as the moments of 

Ysin) =£1 + 5]](afe+i + 

By the uniqueness statement in Lemma ll.llL it follows that 

RfMimfMi = Rfxi + Rix2 as claimed. □ 

1.7.6. Analytic and combinatorial R-transforms are the same. It 
now remains to prove that the combinatorial i?- transform R^{z) = 
^ ttn+iz^ satisfies the formula relating it to the Cauchy transform 
Gf^ (and so a„ = a„). The proof of the following proposition is due 
to Haagerup IIHaa97L 

Proposition 1.13. Let K^{z) = ^ + Rf,{z) = z^'^ + J2 o^k+iz^- With 
the above notation, one has 

G,{K,{z)) = z 

and 

K,{G,{C)) = C, 

both equalities interpreted in terms of composition of formal power 
series. 

Proof Let £ be a free creation operator corresponding to a unit 
vector e, and acting on the full Fock space F(C). Let x = t + f{£), 
where / is a polynomial with real coefficients. Thus by definition, 

RM = fi^)- 
For z e C with \z\ < 1, consider the vector 



n=l 

Then 



iu, = V 2-e®("+^) = -(uj, -Q), < b| < 1. 

z 

71=0 

Similarly, 

00 



00 

n=l 

Thus is an eigenvector for i* with eigenvalue z. Hence 



x*u;, = (£ + /(r)H = £w, + /(zK 
= -{u^ - Q) + f{z)uj^ 

= {- + f{z))uj,--n, o<|2|<i. 

2/ Z 
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It follows that 

Now choose < 5 < 1, so that [[\ + f {z)) I — x*) is, invertible for 
< < 5. This is possible, since lim^^o \ \ + f{.z) \ = oo (since f{z) 
is a polynomial). Hence 

1 



thus 



+ f{z) ]i-x* ] n = ziu. 



l + f{z)]i-x*] ] = (((^ + f(z)]i-x*] n,n 



= z{Uz, fl) = Z. 

Since by definition of G^, 



G'^(A) = ((Al - x)-^) = (j) ((Al - 

Since all of the coefficients of the power series G'^(A) are real, we 
get that 

so that 

G^(A) = 0((Al-x*)-i). 
We now substitute A = ^ + to get 

((i + /(.))) 

We also see that G is invertible with respect to composition on 
some neighborhood. Applying its inverse to both sides, and re- 
membering that f{z) = R^{z), we get that 

K,iz) = R,iz) + -^=G-^\z) 

as claimed. 

This concludes the proof in the case that i?^ is a polynomial; 
the general statement can be deduced from this partial case by 
taking limits. □ 

We have thus proved (a) and (c) of Theorem ll.lOi 
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1.7.7. Semicircular variables. Let us prove (b). Assume that 
R^{z) = z. Then K^,{z) ^l + z and 

Solving this gives 

One can recover // from by the formula 

diJ,{t) — lim —GJt + is)dt. 

Since G^(C) ^ as C ^ oo (as is apparent from the integral for- 
mula for the Cauchy transform), the branch of the square root 
must be chosen so that i/^^ _ 4 > g for ( real and large. It follows 
that 

dfi{t) = — V4 - f^dt, t e [-2, 2], 

271 

and dij,{t) = outside of this interval. 

Note that if R^{z) = z, then aj = aj = unless j — 2. It follows 
that the variable £1 + on the Fock space -F(C) has semicircular 
distribution. 

1.7.8. Proof of free CUT. We are now ready to give a proof of the 
free central limit theorem. 

Let Xi, . . . , X„, . . . be freely independent random variables sat- 
isfying the assumptions of the free central limit theorem, and let 

Zn — {^1 + 1- ^n) , 

Yn — —;={X\-\-----\-Xn) — —j=Zn- 

'n Jn 



Let Vn be the law of X„, let Hn be the law of F„ and let A„ be the 
law of Zn. Thus 

An = i^l ffl • • • ffl i^n 

and because of additivity of i?-transform, 

R^^{z) = K,{z) + --- + R,Sz). 

Write R\-^{z) = X]p<^p+i-2^- Since the coefficient of z^' in R^^{z) is 
a universal polynomial in the moments up to order p of Xj, and 
sup^- |0(XJ)| < 00, it follows that |Q;p"\| < n-Kp, where Kp are some 
constants independent of n. 
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Thus 



1 / \ ^"-^ 



If p > 1 is fixed, the estimate 



V 



impHes that 



(n) 

P+i 

n 2 



0. 



If p = 0, the fact that </)(X„) = so that (/)(y„) = impHes that 
= for all n. Finally, the fact that = 1 implies that 

= 1 and af^ = 1 for all n. 
We conclude that 

as n — i> oo in the sense of coefficient-wise convergence of formal 
power series. Since the p-th moment of jj,n is a universal polyno- 
mial in the first p coefficients of the power series R^^{z), it follows 
that the p-th moment of converges to the p-th moment of the 
unique measure /i for which R^{z) = z. We saw above that this 
implies that ^ is then the semicircle measure, and so yu„ /isemidrc- 



1.8. Further topics. We already briefly touched upon the amaz- 
ing correspondence between various theorems in the classical and 
free context. There are several other instances of this. For exam- 
ple, one can consider the free analog of infinite divisibility. A mea- 
sure yu is called infinitely divisible if for any n there is a measure 
Hn SO that n is the n-fold convolution fin* - ■ ■* fJ'n- One can say that 
/i is freely infinitely divisible if for each n there is a measure /i„ 
so that fi is the n-fold free convolution /i„ ffl • ■ ■ ffl Remarkably, 
there is a one-to-one correspondence between the classically in- 
finitely divisible measures and the free ones. A similar situation 
occurs when considering stable and freely stable laws. 

There is a also a notion of multiplicative free convolution, based 
on taking products of non-commutative random variables. 

The reader is encouraged to consult HVoiOOII for more details. 
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Lecture 2. Random Matrices and Free Probability. 

One of the most important advances in free probability the- 
ory was Voiculescu's discovery that free probability theory de- 
scribes the asymptotic distribution of certain large random ma- 
trices. This has led to a number of applications of free probability 
theory, both to spectral computations for random matrices, and 
to von Neumann algebras. The latter applications rely on the 
somewhat unexpected presence of a "matricial" structure in free 
probability theory: if one takes several square arrays of certain 
free random variables and creates several matrices out of these 
arrays, then the resulting matrices have surprising freeness prop- 
erties (for example, the resulting matrices may be freely indepen- 
dent). 

2.1. Random matrices. A random matrix is a matrix, whose 
entries are random variables. One can also think of a random 
matrix as a matrix- valued random variable, i.e., as a randomly 

chosen matrix. Any Borel function of a random matrix becomes 
then a random variable. For example, the eigenvalues of a ran- 
dom matrix (being functions of its entries) are themselves random 
variables. 

2.1.1. Expected distributions. Let be a self-adjoint random 
matrix of size N x N. We think of Xn as a function Xn ■ S — > 
Mn{C) on some probability space (S, cr). Integration with respect 
to a has the meaning of taking the expected value and will be 
denoted by E. 

One is frequently interested in the expected proportion of the 
eigenvalues ofX^ that lie in a given interval [a,b]: 

Ajv([a, &]) = -^Expected #{ eigenvalues of Xj^ in [a,b]} 

— eigenvalues of Xj^{t) in [a, b]}) 

Let Ai(t), . . . , AAr(t) be the eigenvalues of X{t), listed with multi- 
plicity, and viewed as random variables. Let 

1 ^ 

be a random measure associated with this list of eigenvalues (we 
say that vj^ is random to emphasize that it depends on t, i.e., is a 
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measure-valued random variable). Then 

AN{[a,b]) = E{uj,{[a,b])) 
is the expected value of u^. Thus if we set 

we obtain that 

A7v([a,6]) = Ai7v([a, &])• 

Note that 

= -Tr o a^, 
n 

where (t% is the spectral measure of XN{t). In other words, z/^ 
is the distribution of X^, when viewed as a random variable in 
{Mn{C), jfTr). Thus /xat is the "expected value of the distribution 

2.2. Asymptotics of random matrices. We are mainly inter- 
ested in the asymptotics of the expected number of eigenvalues of 
a random matrix in a given interval. In other words, we are inter- 
ested in studying the asymptotics of the measure ij,n as N ^ oo. 

It should be mentioned that the eigenvalue distributions of ran- 
dom matrices have been studied in several ways. Instead of look- 
ing at the expected numbers of eigenvalues, there is also interest 
in the behavior of eigenvalue spacing (normalized so that the av- 
erage spacing is 1). One is also interested in the behavior of the 
largest and smallest eigenvalues (this translates into considering 
the expected value of the spectral radius, or the operator norm, 
of the matrix X^). We have already heard in this conference of 
the significant progress recently made by Haagerup and Thorb- 
jornsen on the latter problem, in the case that X^ is an arbitrary 
polynomial of a A; -tuple of Gaussian random matrices. 

2.2.1. Wigner's theorem for Gaussian random matrices. Let X^ 
be a self-adjoint random matrix, whose entries are gij, 1 < i,j < 
N, determined as follows. The variables {gij : i < j} are inde- 
pendent; if i < j, then g^j is a centered complex Gaussian random 
variable of variance ^; if i = j, then g^j is a centered real Gauss- 
ian random variable of variance ^. Finally, if i > j, gij — 'gjl. 
One can think of the random matrix Xjv as a map 
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Here S = Mn{C) is the space of complex N x N matrices, is 
the map 

A + A* 

A H-> , 

2 ' 

and cr is the Gaussian measure on S given by 

da{A) = aNe-^'^'^^'^^dA, 

for a suitable constant a at. 

Let AT be as before the expected value of the distribution of X^. 
Then i^n Aisemicirc weakly as ^ oo. This is a very old result, 
going back to the work of Wigner in 1950s [Wig55|. 

It turns out that the semicircle law is fairly universal for ma- 
trices with independent identically distributed entries. In fact, 
Wigner's original work involved matrices X^ whose entries were 
not Gaussian, but random signs. 

2.2.2. Voiculescu's asymptotic freeness results. The semicircular 
law also arose in free probability theory as the central limit law. 
Voiculescu showed that this is not just a coincidence: families of 
certain N x N random matrices behave as free random variables 
in the large N asymptotics. 

For each A^, let Dpfhe a diagonal matrix; assume that the oper- 
ator norms \\Dn\\ are uniformly bounded in A^, and assume that 
the distribution of -Dat (as an element of {Mn{C), ^Tr)) converges 

in moments to a limit measure u. Let . . . be random 
matrices described as follows. Let S = Mn{C)^ with the measure 
a given by 

da{A,, . . . , A) = C^,,e-^^'-(^^^^+-+^^-^'=)rfAi ■ ■ ■ dAk, 
for a suitable constant CN,k- Then is the map 

Ar) + A* 



(P) . f A A.\^. P 



X'^> :{A,,...,A 



More explicitly, if we denote by glf the i,j-th entry of X^^\ then 



{Qif '■ ^ < i < j < N, I < p < k} form a family of independent 
centered Gaussian random variables, so that: g^f is a complex 



Gaussian of variance Hi < j; gli is real Gaussian of variance 

|;and^Jf =^if.>j. 

The family (X^\...,X^^) is sometimes called the Gaussian 
Unitary Ensemble (or GUE) because of the obvious invariance of 
their joint distribution under conjugation by k unitaries. 
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Let /iAT be the distribution of the family {Dn,X}^' , . . . ,X}^'), 
viewed as a Hnear functional on the space of polynomials in /c + 1 
indeterminates. 

Then Voiculescu proved: 

Theorem 2.1. IIVoiQlll Let {d,xi, . . . , Xk) he a family of free random 
variables in a non-commutative probability space {A, (p), so that d 
has distribution v, and xi, . . . ,Xk have semicircular distribution. 
Let /i be the distribution of this family, and let be the distribu- 
tion of {Dn, X^n\ . . . , X^j^^) as described above. Then as N ^ oo, 
fiN ^ fiin moments. 

In other words, for any t and any ji,...,jt G {l,...,k}, 
no, . . . ,nt E {0, 1,2,.. .} one has 



Note that in particular we have that Djy and X)^' , . . . , X}^' are 
asymptotically free. One also recovers Wigner's result, since in 
particular ^ w ii^^, and /x^.^ is the semicircle law. 



2.2.3. Some remarks on the proof We will not prove this theo- 
rem here; see e.g. IVDN921 for a proof We shall only sketch 
the essential combinatorial trick used in the proof and explain its 
connection to non-crossing partitions. 

We concentrate on the case of a single random matrix Xn with 
Gaussian entries gij (depending on A^). 

Consider the value of the moment 



(2.1) —E{Ty{X%)) = 1^ Y1 Ei9hn9i2i3■■■9^,-^i,9^,^^)■ 



If k is odd, it is not hard to see that the value of the moment is 
zero, so we'll assume that k is even for the remainder of the proof 
Since g^j are Gaussian of variance ^, E{gi^i^gi^i^ - ■ ■ gi^_^i^gi^i^) 
is zero unless the variable gi^i^ entering in the product "pair up" 
with another variable gi^,i^, entering the product, and ip = ig', 
iq = ip' (so that gi^i^ = gi^)- That is to say, a term in the sum 
( 12. ID is zero unless for some pairing vr of the set {1, . . . ,k} with 
itself, the indices ii,. . . ,ik satisfy the equations 

(2.2) is = ir, is+i = ir-i ifs~^r, ST^r 




AT 
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(where s + 1 is understood as the remainder mod n, and s ~^ r iff 
s and r are in the same equivalence class of vr). 

Suppose now that we fix tt and ask how large a contribution we 
can get from all of the terms that satisfy for this given vr. The 
equations can be visualized as follows. Let Ck be the cyclic 
graph with k edges, numbered 1 through k. Place ii, . . . , ife on the 
vertices of this graph, so that the j-th edge, oriented clockwise, 
has vertices ij and i^+i, in that order (j + 1 is again understood 
modulo n). In other words, we can think of the map j ^ ij as a 
function on the vertices of C^- 

The pairing tt defines an equivalence relation on the set of edges 
of Ck- edges r and s are equivalent if r s. Form the quo- 
tient graph Ck/ ~7r by gluing equivalent edges with orientation 
reversed. Then is equivalent to saying that the function 
j 1-^ ij descends to a function on the quotient graph Ck/ ^n- The 
total number of such functions is A^^, where v is the number of 
vertices of C^/ ~,r- 

Because the variance of gij is E{gij'g~]) = we can deduce that 
the contribution to the sum 02. ID of those terms that satisfy equa- 
tions (12. 2D for a given tt is at most 



The first factor comes from the normalization of the trace; the 
term {l/N)^^"^ comes from bound on the variance; and the factor 
A^'' comes from our estimation of the number of indices ii, . . . ,ik 
satisfying ( 12. 2D . It follows that the contribution of all of the terms 
that satisfy ( 12. 2D for a given tt is negligible (is of order if 
w < 1 + |. 

Recall that Ck has exactly k edges and that k is even. Thus 
Ck/ ~7r has exactly k/2 edges. It follows that Ck/ ~7r has 1 + | 
vertices exactly if it is a tree. With a little bit of care, one can 
show that (12. ID is then equal to 



On the other hand, we mentioned in ^1.5.4l that the k-th moment 
of a semicircular element is given by 




B(Tr(Xj)) 



E 





r 



a£NCik) 
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where NC{k) stands for the set of non-crossing pairings of 
{1, . . . , fc}. It is not hard to see that if we interpret a pairing a 
of {1, ... , k} as a pairing of edges of Ck, it is non-crossing if and 
only if Ck/ ~a is a tree. This concludes the proof 

2.3. An application to random matrix theory. Keeping the 
notations of Theorem 12. IL let Yat = + X^n ■ hard to 
work out the limit distribution of using free probability tools. 
Indeed, 

On the other hand, d and xi are freely independent. Thus 

l^d+xi = /^d ffl /^xi = ffl /^semicirc- 

The computation of the limit distribution of can then be car- 
ried out using the machinery of -R-transform. 

2.4. Applications to von Neumann algebras. Let us say that 
a non-commutative non-self-adjoint random variable y is circular 
if 3??/ and '^y are freely independent and are semicircular. 

If and are two GUE random matrices, then + 
v^^xj^^ converges in * -distribution to a circular variable. 

If we start with 2^^ GUE random matrices xjj'^'^^xjj'^'^^ 1 < 
i,3 < ^, then we can form a new matrix, 

of size nN x nN, where Y^^^ = X^^'^'^^ + -v/^X^'-''^'*. It is not hard 
to see that ^^Yn, '^-^Yn is a pair of GUE random matrices. We 

thus obtain that -^Ym is circular in the limit X — > cxo. From this 
it is not hard to prove that if a;^, 1 < i,j < n are a free circular 
family, then the matrix 

is again circular. In fact, one can use the asymptotic freeness 
result to show that if we let D be the algebra of scalar diagonal 
nx n matrices, then d is free from (y, y*). 

This fact underlies the earliest applications of free probability 
theory to von Neumann algebras and free group factors. For ex- 
ample, one has the following result of Voiculescu IVoiBOj: 

Theorem 2.2. Let n be an integer, and let t & Q be a rational 
number, so that m = ^{n — 1) + 1 is an integer. Let p G L(F(n)) be 
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a projection in the free group factor L{¥{n)) associated to the free 
group on n generators. Assume that p has trace t. Then 

(2.3) pL(F(n))j9 = L(F(m)). 

This theorem has many far-reaching extensions due to Dykema 
and Ra dulescu, see e.g. [VoiOO, Dyk95, Dyk93b, Dyk93a, Dyk94l 
IRad92[ [Rad941. For example, it turns out that it is possible to 
define for each t g (1, +00] a von Neumann algebra L{¥{t)), called 
an interpolated free group factor, in such a way that L(F(t)) is 
the von Neumann algebra on the free group with t generators, if 
t is an integer. Moreover, the compression formula J2.3D remains 
valid for non-rational traces of p: the result is an interpolated 
free group factor with -^{n — 1) + 1 generators; the same formula 
is valid also for non-integer n. 

For a III factor N, its fundamental group was defined by Mur- 
ray and von Neumann to be the set 

F{M) = {\e (0, +00) : M = pMp for p G M a projection of trace A}. 

Radulescu proved that F(L(F(oo))) = (0, +00) (Voiculescu's re- 
sult quoted above implied that the positive rational numbers 
Q+ C F(L(F(oo)))). In fact, it turns out that there is a dichotomy: 
either all interpolated free group factors are the same among each 
other (and also are isomorphic to L(F(oo))), and all have (0, +00) 
as their fundamental groups; or L(F(oo)) ^ L(¥{t)) for finite t, 
and F{L(¥{t))) = {1} for finite t. It is not known which of the two 
alternatives holds. 

Further developments of these techniques gave information on 
fundamental groups of more general free products of von Neu- 
mann algebras and on subfactors of L(F(cx))) (see e.g. IRad94l 
|Dyk95i IShl98L IShl99[ lP503l l5U02l IDROOH ). 

Lecture 3. Free Entropy via Microstates. 

Free entropy was introduced and developed by Voiculescu in 
a serie s of papers IIVoi93L IVoi94[ IVoi96L IVoi97[ IVoi98bL IVoi98a[ 
rVoi99a[ rVoi99b] as a free probability analogue of the classi- 
cal information-theoretic entropy; see also Voiculescu's survey 
IIVoi02l . 

3.1. Definition of free entropy. Voiculescu's original "mi- 
crostates" approach to free entropy followed Boltzman's definition 
of entropy of a macroscopic state. 
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3.1.1. Microstates and Macrostates. Assume that the macro- 
scopic behavior of a physical system (e.g. gas) is described by sev- 
eral macroscopic parameters (e.g., pressure, volume and temper- 
ature). Then a macrostate s is a state of the system corresponding 
to certain prescribed values of these parameters. 

Microscopically, the system is made out of a large number of 
smaller systems (e.g., the molecules that make up the gas). On 
this microscopic level, the system can be described by a microstate 
s that specifies exactly the states of all of the sub-systems (e.g, the 
exact locations and moments of all of the molecules of the gas). If 
we fix a macrostate S, there are many microstates s that lead to 
the same macroscopic state. 

Boltzman's formula is then that the entropy of S must be given 
by 

K\og^{s : microscopic state s leads to macroscopic state S} 
for some constant K. 

3.1.2. Matricial microstates. Voiculescu's idea is to interpret 

e {A, (f)) as a description of a macroscopic state of a sys- 
tem, and as microstates to take the set of all matrices Xi, . . . , X„ 
of a specific dimension that approximate xi,...,Xn. More pre- 
cisely for xi, . . . , in a non-commutative probability space {A, (p), 
Xj — X*, let Mj^"^ be the space of k x k self-adjoint matrices, and 
consider the set 

r{x,, a;.; fc, /,£) = { (Xi, X„) G (Mf, : 

for any word w inn letters of length at most /, 

\^Tr{w{Xi,...,Xn))-(p{w{xi,...,Xn))\ < e}. 

In other words, we are considering a weak neighborhood U of the 
joint law -redefined by the property that /j,' e U iff the value 
of the law /j,' on all words of length at most I deviates by no more 
than e from that of fixi,...,x„- Next, we consider all self-adjoint kxk 
matrices {Xi, . . . , Xn) so that 

The set r{xi, ... ,Xn]k,l,s) is called the set of (matricial) mi- 
crostates for Xi,. . .,Xn. 
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3.1.3. Definition of free entropy. Voiculescu then defined the free 
entropy by 



where Vol refers to the EucHdean volume associated to the stan- 
dard identification of M^"^ with . We use the convention that 

logO = — oo. 

We should note that x depends only on the law of xi, . . . ,Xn 
and not on the particular realization of this law. It would be also 
appropriate to write 

3.1.4. Relation to Connes' problem. Note that there is no a pri- 
ori reason for r(xi, . . . , fc, /, e) to be non-empty. Connes has 
posed a question in IICon7 6l of whether every IIi factor can be 
embedded into an ultrapower of the hyperfinite IIi factor. It is 
not hard to see that his question is equivalent to the question of 
whether, given a;i, . . . , in a von Neumann algebra (A, </>) with 
a trace, one has that for any e > and / > there is a A; so that 
r(a;i, . . . ,Xn]k, I, e) ^ 0. This question is open even for xi, . . . , x„ 
elements of the group algebra of an arbitrary discrete group T. 

3.2. Properties of free entropy. Voiculescu gave an explicit 
formula for the free entropy of a single variable x with law /i: 



for a certain universal constant C. 

Free entropy has a number of nice properties, related to free- 
ness and analogous to the properties of classical entropy; we list 
a few, due to Voiculescu IIVoi94ll : 

o Ifxi,...,x„ are free, then = x(xi)H hx(x„). 

Furthermore, if . . . = xi^i) H h xM 

then xi,. . . ,Xn are freely independent. 

O xiXi, . . . ,Xn,yi,. . . ,ym) < X{.Xl, . . . , X„) + • • • , l/m). 

o . . . , Xn) is maximal subject to Yli'Pi.'A) = iff 
xi, . . . , is a free semicircular family and each Xi satisfies 



o If si, . . . , s„ are free semicircular variables, freely indepen- 
dent from the family xi,...,Xn, then W*{xi, . . . ,Xn) embeds 
into the ultrapower of the hyperfinite IIi factor if and only 

if xi^i + v^si, . . . ,Xn + VSxn) > — OO for every 6 > 0. Thus 



Xixi, ...,Xn)= inf lim sup — log Voir (xi, . . . 



Xn] IjS) -\ — log k 
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semicircular perturbations (i.e., "free Brownian motion") 
have a regularization effect on free entropy. 

To give but one example of the technical difficulties that working 
with X presents, one would be able to prove that 

. . . , 2/1, . . . , Vrr,) = Xi^l, ■■■,Xn)+ xiVl, ■■■,ym) 

if (xi, . . . , Xn) and {yi, . . . , y^) are free families, provided that one 
could argue that the lim sup in the definition of free entropy is a 
limit. 

3.2.1. Infinitesimal change of variables formula. We end the re- 
view of free entropy by mentioning the change of variables for- 
mula |VoT94l. 

Assume that yi,. . . ,yn are given as non-commutative power se- 
ries in xi, . . . ,Xn: yj = . . . , Xn). Assume moreover that the 
multi-radius of convergence of Fj is large enough to exceed the 
norms of all yi, . . . , yn- Assume further that xj = Gj{yi, . . . , ?/„) for 
some non-commutative power series Gj, and that similarly the 
multi-radius of convergence of Gj is large enough to exceed the 
norms of xi, . . . , x„. 

Let M = W*{xi, . . . ,Xn) = W*{yi, . . . ,yn), and let be the given 
trace on M. Consider the derivation dj : C[xi, . . . M®M 
determined by 

dj{xi) = 6jil 1. 

For example, 

(92(0:1X2X3X2) = Xi X2X3X2 + X1X2 ® X3X2 + X1X2X3 ® 1. 

Let 

J(xi, . . . , x„) = {Jijixi, Xn))l^, e Mn{M®M) 
be the "Jacobian" of F: Jij{xi, . . . , x„) = diFj{xi, . . . , x„). Then 
Xivu • • • = xixi, . . . ,x„) + n\og{\ det |(J(xi, . . . ,x„))), 
where | det | refers to the Kadison-Fuglede determinant 

I det |(J) = exp(rA/„^„(M®Af)(log I J|)). 

Here rM„^„(M(^M) is the tensor product ^Tr of the traces on 

Mnxn and M ® M. 

The explanation of this formula and the appearance of J is that 
the Jacobian of the transformation 

(Xi, . . . , Xn) ^ (Fi(Xi, . . . , . . . , F„(Xi, . . . , X„)), 
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viewed as a map from MJ!^^ ^^fcxfc naturally a matrix 

in M„xn(End(Mfcxfc)) = M„xn(Mfcxfc ® Mfcxfc), and is given by 

J(Xi, . . . , Xn). 

3.3. Free entropy dimension. Voiculescu's original idea for 
defining free entropy dimension was to consider a kind of asymp- 
totic Minkowski dimension of the set of microstates. We present 
below an equivalent definition of K. Jung, which is based on pack- 
ing dimension instead. 

3.3.1. Packing and covering numbers and Minkowski dimension. 
For a metric space X, let PeiX) be the packing number of X; that 
is, the maximal number of disjoint e-balls that can be placed in- 
side X. Similarly, let Ks{X) be the covering number of X; that is, 
the minimal number of er-balls needed to cover X. 

For a metric space X, the upper uniform packing dimension 
and the upper uniform covering dimension are the same and are 
defined as 

logP.(X) logi^.(X) 
limsup — j — = limsup — j — . 

It is a theorem that if X c M*^, then both of these numbers are the 
same as the Minkowski dimension of X, which is given by 

. logVoliV^ 
a — lim mi , 

e^O log £ 

where N^{X) denotes the tubular neighborhood of X of radius £. 

3.3.2. Free entropy dimension. Let xi,...,Xn G be self- 
adjoint. Then let 

Ps{xi,...,Xn) = inflimsup^logP5(r(xi,...,a;„;/,fc,£:)) 

Ks{xi, . . . ,Xn) = milimsnp ^\ogKs(T{xi, . . . ,Xn;l,k,£)). 

Then K. Jung proved the following theorem IIJun02ll : 
Theorem 3.1. One has 

PsiXi,...,Xn) Ks{Xi,...,Xn) 

lim sup = lim sup . 

s^o |logd| s^o |loga| 

Moreover, if si, . . . , Sn are free semicircular variables, free from 

Psi^l: • • • 5 ^n) . r Xi-^lJ • • • ) -^n • "^1' • • • ) ^n) 

lim sup = n — lim mi ■ 



5^0 



log5| s^o log (5^2 
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where Xj = xj + \f5sj. 

The value of any of these limits is then by definition called the 
free entropy dimension 5q{xi, . . . , a;„). 

Here xi^i + v^si, • • • , a;„ + Vdsn : si, . . . , s„) is the free entropy of 
xi-\-\/5si, . . ., Xn+VSsn in the presence of si, . . . , s„; it is a technical 
modification of the free entropy + VSsi, . . . ,Xn + VSsn). Very 
roughly, the value of + VSsi, . . . ,Xn + VSsn) is the asymptotic 
logarithmic volume of a 5^/^-tubular neighborhood of the set of 
microstates for Thus the number 

n ■ "Si, ... , Sn) 

n - lim mt -— r 

<5^o log 5^/2 

is a kind of asymptotic Minkowski dimension of the set of mi- 
crostates. This was the original definition of free entropy dimen- 
sion given by Voiculescu. 
We finish this section with an example. 

Let xi,...,Xn be free semicircular variables. Then xi + 
v^si, . . . ,Xn-\- VSsn are also semicircular. In fact 

Xixi + V5si, ...,Xn + VSsn : Si, . . . , s„) > xi^i, ■ ■ ■ , x^) > -oo. 

It follows that So{xi, . . . ,a;„) = n. In particular, the free group 
factor L(F(n)) can be generated by a family with free entropy di- 
mension n. 

3.4. Properties of free entropy dimension. The theory of free 
entropy dimension has found a number of spectacular applica- 
tions to von Neumann algebra theory. For example, Voiculescu 
used free entropy dimension to prove that free group factors do 
not have Cartan subalgebras; soon thereafter, L. Ge gave a proof 
that free group factors are prime, i.e., cannot be written as tensor 
products of infinite-dimensional von Neumann algebras. 

One of the main remaining questions about free entropy dimen- 
sion is the extent to which So{xi, . . . , Xn) depends on the elements 
xi,...,Xn. Voiculescu asked if 5o{xi, . . . , Xn) is an invariant of the 
von Neumann algebra generated by xi, . . . , a;„, taken with a fixed 
trace. Since L(F(n)) has a generating family with free entropy 
dimension equal to n, a positive answer to this question would 
imply non-isomorphism of free group factors. 

3.4.1. Invariance of Sq. Voiculescu proved that So{xi, . . . ,Xn) de- 
pends only on the restriction of the trace to the algebra gen- 
erated by xi, . . .,Xn. In particular, if F is a discrete group and 
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xi, . . . ,x„ G Cr are self-adjoint generators of the group algebra, 
then 6o{xi, . . . ,Xn) depends only on the group. This invariant 
seems to be related to the L^-cohomology of F; see below. 

3.4.2. Free entropy dimension for a single variable. Voiculescu 
proved that if X has law fi, then 

5o(X) = 1 - Yl 

t an atom of fj, 

In particular, notice that 5o is an invariant of the von Neumann 
algebra (with a fixed trace) generated by X. 

3.4.3. Upper bounds on 6q. If M satisfies any of the following con- 
ditions, then 6o{xi,..., x„) = 1 for any xi,. . . ,Xn G M generating 
M: 

(1) IIVoi96ll M has a Cartan subalgebra, i.e., a maximal abelian 
subalgebra A so that M = W*{{u e M unitary : uAu* = 
A}). Thus free group factors have no Cartan subalgebras. 

(2) ||yoi961 M has a diffuse regular hyperfinite subalgebra: a 
hyperfinite subalgebra R so that M = W*{{u G M unitary : 
uRu* = R}). This is the case, in particular, if M = L(r) 
and r has an infinite normal amenable subgroup. Thus 
free group factors do not have diffuse regular hyperfinite 
subalgebras. 

(3) |^oi96J M has property F: there is a sequence of unitaries 
Un G M, SO that T{un) but \\unX — xun\\2 ^ for all 
X e M. Free group factors are non-F by a classical result 
of Murray and von Neumann. 

(4) llDe9 8l M = Mi®M2 with Mi and M2 infinite-dimensional. 
Thus free group factors are prime. 

In particular, note that M ^ L(F„) * N for any which be embed- 
ded into the ultrapower of the hyperfinite IIi factor (e.g., = C 
is already interesting). 

There are other conditions assuring upper bounds on 5q; we 
mention the work of K. Dykema [ Dyk97 |, M. Stefan [Ste99] and 
of Ge and Shen |GSOO|. Upper estimates on 5q turned out to be of 
relevance also to the theory of type III factors UShlOOL l5hl03bL 

3.4.4. Lower bounds on 5q. K. Jung has proved the following "hy- 
perfinite monotonicity result" [J un03ll : let M be a diffuse von Neu- 
mann algebra, and assume that M is embeddable in the ultra- 
power of the hyperfinite IIi factor. Then (5o(xi, . . . , Xn) > 1 for any 
generators xi, . . . , 
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Combined with the upper estimates, this shows that if M sat- 
isfies any of the properties (l)-(4) above and is embeddable into 
the ultrapower of the hyperfinite IIi factor, then the value of 60 is 
1 on any set of generators. In particular, 60 is an invariant of the 
entire von Neumann algebra! 

Jung has also computed 60 for arbitrary generators of a hyperfi- 
nite algebra |Jun03 1 (which is in general a direct sum of matrix al- 
gebras and a diffuse hyperfinite von Neumann algebra) and once 
again found that 60 is an invariant of the von Neumann algebra 
in that case. 

3.5. Relation with L^-Betti numbers. By HCSL for any gener- 
ators (xi, . . . , Xn) of a tracial algebra {A, r) one has the inequality 
relating 60 to the L^-Betti numbers of A: 

SoiA) = 5o(xi, . . . , x„) < P['\A, t) - /?f (A, r) + 1. 

In particular, specializing to the case of the group algebra of a 
discrete group F, we have that 

So{T) < bf\T) - bf\T) + 1, 

where b^p are the L^-Betti numbers of the group. 

The same combination of Betti numbers also occurs in Gabo- 
riau's work on cost of equivalence relations UGabOOL fGab02ll : in- 
deed he proves that 

bf\T)-b^^\T) + l<C{T), 

where C{T) is the cost of T. There are no known examples in 
which equality does not hold. 

It is curious that C(r) measures the "optimal number of gen- 
erators" for an equivalence relation induced by F; on the other 
hand, 6o{xi,..., x„) is known to be < 1 in many cases in which the 
von Neumann algebra is "singly generated" IGP981. 

One obstruction for the equality between 6o{T) and bf\T) — 
60^^ (r) + 1 is the fact that the latter quantity is insensitive to 
the outcome of Connes' embedding question (if there is an non- 
embeddable group, one can manufacture a non-embeddable group 
with large Betti numbers by taking free products). 

It is also possible to define a "relative" version of Voiculescu's 
free entropy dimension for equivalence relations; one can obtain 
an invariant of an equivalence relation in this way (see LShlOll 
I5hl03al . 
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Lecture 4. Non-microstates Approach to Free 

Entropy. 

We have reviewed the microstates definition of free entropy in 
the previous lecture. There are several difficulties connected with 
that definition. The first is that the involvement of sets of mi- 
crostates makes the definition hard to work with technically; as 
we saw there are several properties of free entropy (such as ad- 
ditivity for free families) that one expects to hold, but which one 
is unable to prove because of such technical difficulties. Another 
example of such acute difficulties arises when one deals with free 
Fisher information. By analogy with the classical case, one wants 
to define the free Fisher information $(xi, . . . , x„) by the formula 



where si,...,s„ are free semicircular variables, free from 
(xi, . . . , Xn). The definition works fine in the case that n = 1 (the 
explicit formula for x is essential), but it is not clear how to prove 
that the derivative exists and that the definition makes sense in 
the case n> 1. 

The other point is that the definition of the microstates free en- 
tropy subsumes existence of microstates, i.e., embedability into 
the ultrapower of the hyperfinite II i factor. A priori, it is not 
clear why one should assume this for elements of an arbitrary 
non-commutative tracial probability space (although of course if 
Connes' embedability question always has an affirmative answer, 
this second point disappears). 

Voiculescu [V oi98a l gave a new definition of free entropy, based 
on an "infinitesimal" approach involving free Fisher information. 
This new approach does not involve microstates and for this rea- 
son the resulting entropy bears the name "non-microstates" or 
"microstates-free". It is not at present known if the two defini- 
tions (microstates and non-microstates) are the same, except in 
the one-variable case; and indeed, showing this would give a pos- 
itive answer to Connes' embedability question. Nonetheless, a 
recent work by Biane, Capitaine and Guionnet IBCG03] shows 
that the microstates free entropy is always smaller than the non- 
microstates entropy. 
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To distinguish the two definitions, quantities related to the non- 
microstates entropy are denoted by the same letter as their mi- 
crostates analogs, but with an asterisk; for example, the non- 
mi crostates free entropy is x*, and the corresponding free entropy 
dimension is 6*. 

4.1. A non-rigorous derivation of the non-microstates def- 
inition. We begin with a (rigorous) consequence of the change of 
variables formula for microstates entropy. We shall assume that 
xi,. . . ,Xn are in a non-commutative probability space A with a 
tracial positive linear functional r. 

4.1.1. Infinitesimal change of variables. Let Pi, . . . , f„ be polyno- 
mials in n indeterminate s. Consider the change of variables 

Then for e sufficiently small, this change of variables can be 
inverted and xj can be expressed as a non-commutative power 
series in terms of yf, . . . ,yf^, so that the multi-radius of conver- 
gence of that power series exceeds the operator norms otyf, . . . ,y^. 
Thus one can apply the change of variables formula and express 
xivl, • • • 5 2/^) in terms of the free entropy xi^i^ • • • ? ^n) and the loga- 
rithm of the Jacobian of our transformation. Expanding the value 
of the logarithm of the Jacobian as a power series in e gives us the 
infinitesimal change of variables formula [Voi97|: 

n 

X{yl, ...,yn) = X{xi, . . . , x„) + £ ^ r ® T{djPj) + 0{e^). 

i=i 

4.1.2. Conjugate variables. Let us now assume that dj : L'^{M) 
L^{M)'(g)L^{M), with M = W*{xi, x^) has the property that 1®1 
is in the domain ofd*. Let = d*{l ® 1) e L'^{M). The elements 
^1, . . . , ^„ are called conjugate variables to (xi, . . . , x„) and satisfy 

{^j,Q) = {d,{Q),l®l) = T(E)Tidj{Q)), 

for any polynomial Q G C[Xi, . . . , Xn]. 
Then our infinitesimal change of variables formula becomes: 

n 

x(?/i,...,z/D = xixi, . . . ,Xn) + £J2{P,,Q + 0{e''). 

i=i 

It turns out that conjugate variables are intimately connected 
with free Brownian motion. If we let 

X j Xj ~\~ '\/~ESj J 
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where sj are a free semicircular family, free from xi,...,Xn, then 
for any polynomial Q in n indeterminates one can prove that 

t{Q{xI, . . . , O) = t{Q{Xi + 1^1, ■■■,Xn + l^n)) + 0{6^). 

Thus perturbations by conjugate variables give an "approxi- 
mation" in law to free Brownian motion; note, however, that 
while Xj no longer lies in W*{xi, . . . ,Xn), Xj + does lie in 
L\W*{xr,...,Xn)). 

Conjugate variables frequently exist. For example, if xi, . . . ,Xn 
are a free semicircular family, then exist and in fact = sj, 
j = 1, . . . , n. One can show that for any xi,. . . ,Xn and any e > 0, 
conjugate variables to the family {xi + y/esi ,Xn + \/ssn) always 
exist. In fact, in this case 

1 



Ew*{xi+^si,...,x„+^s„) 1 /-Sj 



4.1.3. Non-rigorous derivation of the formula for ^{xi, . . . ,Xn). 
Assume now that {Ci,---,Cn) are conjugate variables to 
{xi,. . . , Xn). Let (si, . . . , Sn) be as before a free semicircular sys- 
tem, free from {xi, . . . , Xn). 
Recall that we want to define the free Fisher information by 

...,Xn) = 2^x(^i + V^si, ...,Xn + Vex^). 

Since xi^i, ■ ■ ■ , Xn) depends only on the law of xi, . . . ,Xn, and since 
the laws of {xi + v^si, . . . , a:„ + ^/exn) and {xi + . . . ,Xn + fCn) 
are the same up to higher orders in e, one would expect that 

d S £ 

We now assume that are sufficiently nice functions of xi, . . . ,Xn 
so that the infinitesimal change of variables applies. Thus 



e e " 1 

n 



Summarizing, we then expect that 

n 

^Xi,...,Xn) = J2 ll^illi^(M)- 



NOTES ON FREE PROBABILITY THEORY 



34 



4.1.4. Definition of { ). This leads us to take the non- 

rigorous formula for $ as a definition of the non-microstates free 
Fisher information: 

Definition 4.1. [Voi98a] Let be a family of non- 

commutative random variables in (A, r). If conjugate variables 
(^1, . . . , ^n) to this family exist, then we set 

n 

If the conjugate variables do not exist, we set . . . , Xn) = +oo. 

Note that this definition does not involve microstates. 

In the case of a single variable, ends up being the restriction 
of the Hilbert transform of the distribution of si to the support 
of this distribution. One can then compute that if fi^ is Lebesgue 
absolutely-continuous, and dfix{t) = p{t)dt, then 




4.1.5. Definition ofx*- Since $* was supposed to be proportional 
to the derivative of free entropy one can recover free entropy from 
the free Fisher information. The formula is 

here as before x* = Xj + \/tSj, and (si, . . . , s„) is a free semicircular 
family, free from ( 
Voiculescu proved that the function 

is monotone decreasing and right semi-continuous in the sense 
that 

lim = 

It is an important open question if this function is always contin- 
uous. 

Furthermore, if n = Yl^i^])-' then 

which implies that the integral defining x* makes sense and con- 
verges to a value in [— oo, +cx)). 



NOTES ON FREE PROBABILITY THEORY 



35 



4.2. Properties of x*. As we mentioned in the foreword to this 
section, the principal outstanding question in the theory of free 
entropy is the question of when x = X*- To this end there are two 
results: 

o irVoi98all In the single-variable case, the two quantities are 

equal: xi^i) = X*{xi); 
o ||BCGQ3I In general, the following inequality is satisfied: 

Xixi, ...,Xn)< X*iXl, . ■ .,Xn). 

The non-microstates definition turns out to be easier to work 
with in some respects, but harder in others. One of the big dif- 
ficulties in the non-microstates framework is one's inability to 
prove the change of variables formula. This difficulty is related 
to our inability to handle the continuity properties of the "non- 
commutative Hilbert transform", (xi, . . . , x„) ^ i^i, ■ ■ ■ ,^n), where 
(^1, . . . , C,n) are the conjugate variables to (xi, . . . , x„). 

Nonetheless, x* has a lot of nice properties, for example: (all of 
these are from HVoiQSaH ) 

O X*iXl, ...,Xn,yi,...,ym) = X*{XU ■■■,Xn) + X* [VU ■ ■ ■ , Vm) if 

the families (xi, . . . , x„) and {yi,..., ym) are free; 

O X*{.Xi, ...,Xn,yu.. .,ym) < X*iXl, ...,Xn)+ X*iyU ■ ■ ■ ^ym)\ 

o • • • , Xn) is maximal subject to ^ t{x1) = if and only 

if xi,...,Xn are free semicircular variables, and t(x^) = 

■■■ = r{xl) = l. 

o If si, . . . , s„ are free semicircular variables, free from the 
family then for any e > 0, x*ixi + v^si, . . . , x„ + 

^/eSn) > -OO. 

Comparing the last property of x* with the corresponding prop- 
erty of X explains why x = X* would imply a positive answer to 
Connes' embedability question. 

4.3. Non-microstates free entropy dimension. Although we 
don't know how to formulate the packing number definition of 
free entropy dimension in the non-microstates approach, the 
Minkowski dimension definition does have a straightforward ana- 
log. We set 

S*{xi, ...,Xn)=n- liminf , ^' " ''I " , 

e^O log 6^''' 

where as before Xj = xj + ^/es.j, and si, . . . , s„ is a free semicircular 
family, free from the family xi, . . . , x„. 
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It is tempting to formally apply L'Hopital's rule in the definition 
of 6* and use the fact that ■£x*{xh • • • ' ^n) = • • • 5 ^n). Thus 

we write 

6* = n — liminf , . . . , x^). 

One can easily show that 

5*{Xi, ...,Xn)> 5*{xi, . . . 

with no examples in which equality does not hold. 

There are unfortunately preciously few computations of 5* or 
6*, and much less is known about their properties than about the 
properties of 6. In particular, it is not known in general if 6* or 6* 
depend only on the algebra generated by xi, . . . ,Xn, taken with its 
trace. 

We summarize what is known below: 

o 6*{xi) = 6*{xi) = 6o{xi) = 1 - where /i^. is the 

law of xi; 

o 6*{xi, . . . ,Xn,yi, . . . ,ym) = S*{xi, . . . ,Xn) + 6*{yi, . . . ,ym) if 
(xi, . . . , Xn) are free from {yi,. . . , ym); the same is true for 

o BCSH l{ xi, . . . ,Xn are generators of a tracial algebra {A, r), 
then S*{xu...,xn)<6^{xi,..., x„) < [3[^\a, t) - /5f (A, r) + 
1, wheref3f{A, t) are the L^-Betti numbers of {A, r). 

o IMS] If G Cr are self-adjoint and generate the 

group algebra of a discrete group T, then equality holds: 
6%xi, . . . = 6*{x,, ...,xr.) = bf\T) - bl^\T) + 1, where 
bf\T) = /3f{CT) are the L^-Betti numbers of T. In partic- 
ular, in this case 6* = 6* are algebraic invariants; 

o BCSH If W*{xi, . . . ,Xn) has diffuse center, then 

S*{Xi, ...,Xn)<l. 
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